Temporal Opinion Spam Detection by Multivariate Indicative Signals

نویسندگان

  • Junting Ye
  • Santhosh Kumar
  • Leman Akoglu
چکیده

Online consumer reviews reflect the testimonials of real people, unlike advertisements. As such, they have critical impact on potential consumers, and indirectly on businesses. According to a Harvard study (Luca 2011), +1 rise in star-rating increases revenue by 5–9%. Problematically, such financial incentives have created a market for spammers to fabricate reviews, to unjustly promote or demote businesses, activities known as opinion spam (Jindal and Liu 2008). A vast majority of existing work on this problem have formulations based on static review data, with respective techniques operating in an offline fashion. Spam campaigns, however, are intended to make most impact during their course. Abnormal events triggered by spammers’ activities could be masked in the load of future events, which static analysis would fail to identify. In this work, we approach the opinion spam problem with a temporal formulation. Specifically, we monitor a list of carefully selected indicative signals of opinion spam over time and design efficient techniques to both detect and characterize abnormal events in real-time. Experiments on datasets from two different review sites show that our approach is fast, effective, and practical to be deployed in real-world systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analyzing and Detecting Opinion Spam on a Large-scale Dataset via Temporal and Spatial Patterns

Although opinion spam (or fake review) detection has attracted significant research attention in recent years, the problem is far from solved. One key reason is that there is no large-scale ground truth labeled dataset available for model building. Some review hosting sites such as Yelp.com and Dianping.com have built fake review filtering systems to ensure the quality of their reviews, but the...

متن کامل

Towards Accurate Deceptive Opinion Spam Detection based on Word Order-preserving CNN

As a mainly network of Internet naval activities, the deceptive opinion spam is of great harm. The identification of deceptive opinion spam is of great importance because of the rapid and dramatic development of Internet. The effective distinguish between positive and deceptive opinion plays an important role in maintaining and improving the Internet environment. Deceptive opinion spam is very ...

متن کامل

An approach for detecting spam in arabic opinion reviews

For the rapidly increasing amount of information available on the Internet, little quality control exists, especially over the user-generated content. Manually scanning through large amounts of user-generated content is time-consuming and sometime impossible. In this case, opinion mining is a better alternative. Although, it is recognized that the opinion reviews contain valuable information fo...

متن کامل

Deceptive Review Spam Detection via Exploiting Task Relatedness and Unlabeled Data

Existing work on detecting deceptive reviews primarily focuses on feature engineering and applies off-the-shelf supervised classification algorithms to the problem. Then, one real challenge would be to manually recognize plentiful ground truth spam review data for model building, which is rather difficult and often requires domain expertise in practice. In this paper, we propose to exploit the ...

متن کامل

Automatic detection of deceptive opinions using automatically identified specific details

Distinguishing deceptive opinions — that is, fabricated views disguised to be genuine — from honest opinions is a hard problem. Deceptive opinions can include things like the false expression of a controversial opinion, a misleading review of an item or service bought online, or deceitful interviews. Unlike many tasks involving language, detecting deceptive opinions through text alone turns out...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016